PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.6133s0035.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 683aa    MW: 75861.5 Da    PI: 5.9077
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.6133s0035.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix97.21.4e-3060144187
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                          rW+++e+laL+++r++m++++r++ lk+plWe+vs+k+ e g++rs+k+Ckek+en++k+yk++ke++++r++++   +++f+qlea
  Cagra.6133s0035.1.p  60 RWPREETLALLRIRSDMDSTFRDATLKAPLWEHVSRKLLELGYKRSAKKCKEKFENVQKYYKRTKETRGGRHDGK--AYKFFSQLEA 144
                          8*********************************************************************86555..5******985 PP

2trihelix104.29.4e-33446530186
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          rW+k e+laLi++r+ me r++++  k+ lWee+s  m++ g++r++k+Ckekwen+nk+ykk+ke++kkr +++ +tcpyf++l+
  Cagra.6133s0035.1.p 446 RWPKAEILALINLRSGMEPRYQDNVPKGLLWEEISTSMKRMGYNRNAKRCKEKWENINKYYKKVKESNKKR-PQDAKTCPYFHRLD 530
                          8*********************************************************************8.99999*******97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.86453117IPR017877Myb-like domain
SMARTSM007170.007657119IPR001005SANT/Myb domain
CDDcd122038.16E-2759124No hitNo description
PfamPF138371.1E-2059145No hitNo description
SMARTSM007170.011443505IPR001005SANT/Myb domain
CDDcd122036.93E-30445510No hitNo description
PROSITE profilePS500906.667445503IPR017877Myb-like domain
PfamPF138375.9E-22445530No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0008361Biological Processregulation of cell size
GO:0010090Biological Processtrichome morphogenesis
GO:0030308Biological Processnegative regulation of cell growth
GO:0032876Biological Processnegative regulation of DNA endoreduplication
GO:0042631Biological Processcellular response to water deprivation
GO:0045892Biological Processnegative regulation of transcription, DNA-templated
GO:2000037Biological Processregulation of stomatal complex patterning
GO:2000038Biological Processregulation of stomatal complex development
GO:0005634Cellular Componentnucleus
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 683 aa     Download sequence    Send to blast
MEQVGGGGGG NEVVEEASPI SSRPPASNNL EELMRFSAAA DDGGGGGGGG GSASSSSGNR  60
WPREETLALL RIRSDMDSTF RDATLKAPLW EHVSRKLLEL GYKRSAKKCK EKFENVQKYY  120
KRTKETRGGR HDGKAYKFFS QLEALNTTPT PLQPHPHPPS SSLDVTPLSV ANPILMPSSS  180
SSPFPIFSQP QPQTQPPQTH TVSFTPTPLP PPPPMAPTFP GVTFSSHSSS TASGMGSDDD  240
DDDDDMDVDQ ANIAGSSSRK RKRGNRGGGK MMELFEGLVR QVMQKQAAMQ RSFLEALEKR  300
EQERLDREEA WKRQEMSRLA REHEVMAQER AASASRDAAI ISLIQKITGH TIQLPPSLSS  360
QTPQVPPHQP PQPPPAAKRA HQEPQLSTAQ SQLQQPIMAI PQQQILPSPH LPHQPEQKQQ  420
QQQQQQQVQE MIVSSEQSSL LPSSSRWPKA EILALINLRS GMEPRYQDNV PKGLLWEEIS  480
TSMKRMGYNR NAKRCKEKWE NINKYYKKVK ESNKKRPQDA KTCPYFHRLD LLYRNKVLGG  540
SGGGGGSSTS GLPQEQKQSP VSAMKPPQEG LVNVQPHESG SSEEVEPIDQ ESTPQGTEKP  600
EDLVMRELMQ QQQQQQQESM IGEYEKIEES HNYNNMEEEE EEMDEEELDE EEKSAAFEIA  660
FQSPANRGGN GHTEPPFLTM VQ*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1102110KRSAKKCKE
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00177DAPTransfer from AT1G33240Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0210450.0AC021045.2 Arabidopsis thaliana chromosome I BAC T9L6 genomic sequence, complete sequence.
GenBankAC0270350.0AC027035.5 Arabidopsis thaliana chromosome 1 BAC T16O9 genomic sequence, complete sequence.
GenBankAJ0032150.0AJ003215.1 Arabidopsis thaliana GTL1 gene.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010461140.10.0PREDICTED: trihelix transcription factor GTL1-like isoform X3
SwissprotQ9C8820.0GTL1_ARATH; Trihelix transcription factor GTL1
TrEMBLR0IRM90.0R0IRM9_9BRAS; Uncharacterized protein
STRINGBra040010.1-P0.0(Brassica rapa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM62262746
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G33240.10.0GT-2-like 1